NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Cooperative Speech Separation With a Microphone Array and Asynchronous Wearable Devices

https://doi.org/10.21437/Interspeech.2022-11025

Corey, Ryan; Mittal, Manan; Sarkar, Kanad; Singer, Andrew C. (September 2022, Proc. Interspeech 2022)

We consider the problem of separating speech from several talkers in background noise using a fixed microphone array and a set of wearable devices. Wearable devices can provide reliable information about speech from their wearers, but they typically cannot be used directly for multichannel source separation due to network delay, sample rate offsets, and relative motion. Instead, the wearable microphone signals are used to compute the speech presence probability for each talker at each time-frequency index. Those parameters, which are robust against small sample rate offsets and relative motion, are used to track the second-order statistics of the speech sources and background noise. The fixed array then separates the speech signals using an adaptive linear time-varying multichannel Wiener filter. The proposed method is demonstrated using real-room recordings from three human talkers with binaural earbud microphones and an eight-microphone tabletop array.
more » « less
Adaptive Crosstalk Cancellation and Spatialization for Dynamic Group Conversation Enhancement Using Mobile and Wearable Devices

https://doi.org/10.1109/IWAENC53105.2022.9914773

Corey, Ryan M.; Mittal, Manan; Sarkar, Kanad; Singer, Andrew C. (September 2022, 2022 International Workshop on Acoustic Signal Enhancement (IWAENC))

We propose a system to improve the intelligibility of group conversations in noisy environments, such as restaurants, by aggregating signals from the mobile and wearable devices of the participants. The proposed system uses a mobile device placed near each talker to capture a low-noise speech signal. Instead of muting inactive microphones, which can be distracting, adaptive crosstalk cancellation filters remove the speech of other users, including delayed auditory feedback of the listener’s own speech. Next, adaptive spatialization filters process the low-noise signals to generate binaural outputs that match the spatial and spectral cues at the ears of each listener. The proposed system is demonstrated using recordings of three human subjects conversing with realistic movement.
more » « less
Mechatronic Generation of Datasets for Acoustics Research

https://doi.org/10.1109/IWAENC53105.2022.9914771

Lu, Austin; Moore, Ethaniel; Nallanthighall, Arya; Sarkar, Kanad; Mittal, Manan; Corey, Ryan M.; Smaragdis, Paris; Singer, Andrew (September 2022, 2022 International Workshop on Acoustic Signal Enhancement (IWAENC))

We address the challenge of making spatial audio datasets by proposing a shared mechanized recording space that can run custom acoustic experiments: a Mechatronic Acoustic Research System (MARS). To accommodate a wide variety of experiments, we implement an extensible architecture for wireless multi-robot coordination which enables synchronized robot motion for dynamic scenes with moving speakers and microphones. Using a virtual control interface, we can remotely design automated experiments to collect large-scale audio data. This data is shown to be similar across repeated runs, demonstrating the reliability of MARS. We discuss the potential for MARS to make audio data collection accessible for researchers without dedicated acoustic research spaces.
more » « less
Utilizing Time-of-Flight LIDARs For Spatial Audio Processing

https://doi.org/10.1109/IEEECONF53345.2021.9723358

Sarkar, Kanad; Corey, Ryan M.; Singer, Andrew C. (January 2021, 55th Asilomar Conference on Signals, Systems and Computers, ACSSC 2021)

Full Text Available

Search for: All records